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Cities and their transportation systems become increasingly complex and mul¬ 
timodal as they grow, and it is natural to wonder if it is possible to quantita¬ 
tively characterize our difficulty to navigate in them and whether such navi¬ 
gation exceeds our cognitive limits. A transition between different searching 
strategies for navigating in metropolitan maps has been observed for large, 
complex metropolitan networks. This evidence suggests the existence of an¬ 
other limit associated to the cognitive overload and caused by large amounts 
of information to process. In this light, we analyzed the world’s 15 largest 
metropolitan networks and estimated the information limit for determining a 
trip in a transportation system to be on the order of 8 bits. Similar to the “Dun¬ 
bar number,” which represents a limit to the size of an individual’s friendship 
circle, our cognitive limit suggests that maps should not consist of more than 
about 250 connections points to be easily readable. We also show that includ¬ 
ing connections with other transportation modes dramatically increases the in¬ 
formation needed to navigate in multilayer transportation networks: in large 
cities such as New York, Paris, and Tokyo, more than 80% of trips are above 
the 8-bit limit. Multimodal transportation systems in large cities have thus al¬ 
ready exceeded human cognitive limits and consequently the traditional view 
of navigation in cities has to be revised substantially. 
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Introduction 


The number of “megacities”—urban areas whose human population is larger than 10 million— 
has tripled since 1990 ( 1 ). New York City, one of the first megacities, reached that level in the 
1950s, and the world now has almost 30 megacities, which together include roughly half a bil¬ 
lion inhabitants. The growth of such large urban areas usually also includes the development of 
transportation infrastructure and an increase in the number and the use of different transporta¬ 
tion modes (2). For example, about 80% of cities with populations larger than 5 millions have a 
subway system (3). This leads to a natural question: Is navigating transportation systems in very 
large cities too difficult for humans (4)? Additionally, how does one quantitatively characterize 
this difficulty? 

It has long been recognized that humans have intrinsic cognitive limits for processing in¬ 
formation (5). In particular, it has been suggested that an individual can maintain only on the 
order of 150 stable relationships ( 6 , 7). This “Dunbar number” was first proposed in the 1990s 
by British anthropologist Robin Dunbar by extrapolating results about the correlation of brain 
sizes and the typical sizes of social groups for various primates. Although it is still controver¬ 
sial, it has been supported by subsequent studies on, e.g., traditional human societies (3) and 
microblogging (9). 

When navigating for the first time between two unfamiliar places and having a transportation 
map as one’s only support, a traveler has to compare different path options to find an optimal 
route. Differently from the case of the schematization of partially familiar routes ( 10 ), here the 
traveler does not need to simultaneously visualize the whole route; it is sufficient to identify and 
keep track of the position of the connecting stations on the map. Therefore, a first important 
point to consider is that humans can track information on a maximum of about four objects in 
their visual working memory ( 11 ). This implies that a person can easily keep in mind the key 
locations (origin, destination, and connection points) for trips with no more than two connec¬ 
tions (which corresponds exactly to four different points). In addition, recent studies on visual 
search strategies ( 12 , 13 ) show a transition in search strategies between the simple cases of the 
Stuttgart and Hong Kong metropolitan networks and the case of Paris, which has one of the most 
complicated transportation networks in the world. The time needed to find a route in a trans¬ 
portation network grows with the complexity of its map, and the pattern of eye fixations also 
changes from following metro lines to a random scattering of eye focus all over the map ( 12 ). 
A similar transition from directional to isotropic random search has been observed for visual 
search of hidden objects when increasing the number of distractors ( 14 ). The ability to man¬ 
age complex “mental maps” is thus limited, and only extensive training on spatial navigation 
can push this limit with morphological changes in the Hippocampus ( 15 ). Human-constructed 
environments have far exceeded these limits, and it is interesting to ask whether there is a navi¬ 
gation analog of the Dunbar number and a cognitive limit to human navigation ability, such that 
it becomes necessary to rely on artificial systems to navigate in transportation systems in large 
cities. If such a cognitive limit exists, what is it? In this paper, we answer this question using 
an information perspective (5) to characterize the difficulty to navigate in urban transportation 
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networks. We use a measure of “information seareh” assoeiated to a trip that goes from one 
route to another {16). In most networks, many different paths eonneet a pair of nodes, and one 
generally seeks the fastest path that minimizes the total time to reaeh a destination. However, it 
tends to be more natural for most individuals to instead eonsider a “simplest” path, whieh has 
the minimum number of eonneetions {18) (see Fig.[^. 
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Figure 1: Fastest and Simplest paths in Primal and Dual networks. (Left) In the primal 
network of the New York City metropolitan system, the simplest path (highlighted in light blue) 
from 125th St. on line 5 (dark green) to 121st St. on line J (brown) differs signifieantly from the 
fastest path (highlighted in grey). There is only one eonneetion for the simplest path {Brook¬ 
lyn Bridge - City Hall / Chambers Street) in Lower Manhattan. In eontrast, the fastest path 
needs three eonneetions (5—^-F—>■£—>■ J). We eompute the fastest path using travel times from the 
Metropolitan Transportation Authority (MTA) Data Feeds (see Materials and Methods). We 
negleet walking and waiting times. (Right) In the dual spaee, nodes represent routes and edges 
represent eonneetions. A simplest path in the primal spaee is defined as a shortest path with the 
minimal number of edges in the dual spaee (light blue arrow). It has a length of C = 1 and 
oeeurs along the direet eonneetion between line 5 (dark green node) and line J (brown node). 
The fastest path in the primal spaee has a length of C = 3 (grey arrows) in the dual spaee, as 
one has to ehange lines three times. (We extraeted the NYC metro sehematie from a map that 
is publiely available on Wikimedia Commons (55).) 

Rosvall et al. {16) proposed a measure for the information that is needed to eneode a shortest 
path from a route s to another route t. However, the amount of neeessary information ean 
depend strongly on the initial and final nodes, and we eonsider a trip from an origin node i in 
route s to a destination node j in route t. This trip is embedded in real spaee and among all 
possible simplest paths {16) (whieh need not to be unique), we piek the fastest one p{i, s]j, t). 
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which can differ from an actual fastest path between i and j (see the left panel of Fig. [^. 
For computing the travel time of a trip, we neglect the eontribution of walking and waiting 
times (2). However, the ehoiee of a simplest path already tends towards minimization of such 
transfer costs, which strongly influence a traveler’s deeisions (77). 

The total information for knowing the fastest simplest path is 

^(*,s;j,f) = -iog2 n 

y * nep{i,s-,j,t) y 

where p{i, s; j, t) is the sequenee of routes needed for eonnecting i in route s to j in route t. The 
term kg is the number of routes eonnected to s, and along the path, we have the ehoiee between 
kn — l routes. The idea behind Eq. Q is that when tracking a trip along a map (with the eyes or a 
finger), the eonneetions that one has to exelude represent—similarly to the number of distraetors 
in visual seareh tasks (14 )—the information that has to be proeessed and thus temporarily stored 
into working memory (19). One ean therefore eonstruet the measure of entropy O as a proxy 
for the aeeumulated eognitive load that is assoeiated to the trip, and it is analogous to the total 
amount of load experieneed during a task (20). For this reason, this measure of entropy seems 
to be appropriate for estimating the information limit assoeiated to the observed transition in 
the visual search strategy (12,14). 

From a map user’s perspeetive, the existenee of several alternative simplest paths is not 
necessarily a significant factor, as one only needs a single simplest path for sueeessful trans¬ 
portation from origin to destination. Consequently, we use the entropy in Eq. Q rather than the 
one proposed in Ref. (16). (See the Materials and Methods for further diseussion.) To produee 
a single summary statistie for a path, we average S(i, s]j, t) over all nodes i e s and j G f (we 
denote this mean using brackets (■)) to obtain 

S(s,t) = {S(i,s;j,t)), (2) 

whieh is the main quantity that we use to deseribe the eomplexity of a trip (see Eig. 5) and 
whieh will allow us to extraet an empirieal upper limit to the information that a human is able 
to proeess for navigating. 


Results 

We use the measure S(s, t) to eharaeterize the eomplexity of the 15 largest urban metropolitan 
systems in the world. Erom these results, we then extraet an empirieal upper limit of about 8 bits 
to the information that a human is able to proeess for navigating. We then apply this threshold 
in ealeulations for multimodal transportation networks and demonstrate that most trips in large 
eities exeeed human eognitive limits. 


4 



1.0 


CN 

II 

o 


0.8 


0.0 


0.4 


0.2 


0.0 



NYC 

Paris 

Tokyo 

Barcelona 

Madrid 

Seoul 

Beijing 

Hong Kong 

Mexico City 

Osaka 

Shanghai 

Berlin 

London 

Moscow 

Chicago 

Paris MRT 

8.1 bits 


0 2 4 0 8 10 12 

Sis, t I C = 2) (bits) 



Figure 2: (Left) Information threshold. Cumulative distribution of the information needed to 
eneode trips with two eonneetions in the 15 largest metro networks. The largest value oeeurs for 
the New York City metro system (solid red eurve), whieh has trips with a maximum of Smax ~ 
8.1 bits. Among the 15 networks, the Hong Kong (dashed, red) and Beijing (solid, blaek) 
metro networks have the smallest number of total eonneetions and need the smallest amount of 
information for navigation. The Paris MRT (Metro, Light Rail, and Tramway) network (orange, 
dash-dotted) from the official metro map (32). includes three transportation modes (which are 
managed by two different companies) and reaches values that are similar to those in the larger 
NYC Metro. (Right) Information threshold versus total number of connections in the dual 
space. This plot illustrates that the mean amount of information that is needed to encode trips 
with two connections is strongly correlated with the total number of connections in the dual 
network, as can be predicted for a square lattice (see Materials and Methods). See Table 1 for the 
definitions of the abbreviations. The color code is the same as in the left panel, and the red solid 
line represents the square-lattice result ^(s, flC = 2) = log 2 (iFtot)- This relationship permits 
one to associate the information threshold Smax with the cognitive threshold T = which 
one can interpret as a maximum number T of intra-route connections that can be represented 
on a map. 

Information Threshold 

The values of S(s,t) in a network tend to grow with the number C of connections that appear 
in a simplest path as well as with the meanmean degree (k) of the nodes in the dual space (see 
Figs. 6 and 7). Note that the latter is related to the total number of connections in a network. 
Adding new routes can thus have a negative impact from the information perspective. Although 
new routes can be useful for shortening the simplest paths for some (s, t) pairs, new connections 
simultaneously increase the mean degree of a network and can make it more difficult to navigate 
in a network. We thus want to estimate the maximum possible information that an individual 
can reasonably process to navigate in a transportation system. For that purpose, we consider 
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the world’s 15 metro networks with the largest number of stations. The eharaeteristies of metro 
networks were examined in previous papers {3, 21-26), and navigation strategies have been 
eonsidered in transportation networks (27-29). For eaeh network, we eonsider the shortest 
simplest paths with C = 2 eonneetions. This eorresponds to paths that use 3 different lines: 
sueh a path starts from a souree route s, eonneets to an intermediate route r, and then eonneets 
to a destination route t. There are two distinet reasons for this ehoiee: (i) the limit of four objeets 
in the visual working memory (11)', (ii) in most of the 15 eities, two eonneetions eorrespond 
to the diameter of the dual network. From a map user’s perspeetive, after having eheeked that 
eaeh pair of eonseeutive stations are eonneeted by a direet line, the loeations to keep in mind 
are the origin, the destination, and eonneeting stations. These nodes eorrespond to the plaees 
that one has to “highlight” on the map to reeord the trajeetory. The eapaeity of visual working 
memory thus allows one to easily keep in mind only trajeetories with 2 eonneetions, leading to 
a total of 4 stations. Interestingly, the value 2 is also the diameter of the dual network in most of 
the metropolitan systems. In sueh situations, all pair of nodes ean be eonneeted by paths with 
at most 2 eonneetions, staying below the working memory limit of humans. 

In the left panel of Fig.we show the eumulative distribution of entropies S(s, t\C = 2) for 
these 2-eonneetions paths. We find that the New York City metro system is the largest and most 
eomplex metropolitan system in the world; it has a maximal value of 8.1 ~ log2(274) 

bits. Paris’ transportation system reaehes a similar value if one takes into aeeount the light rail 
and tram system in the multilayer Metro-Rail-Tramway (MRT) network displayed in the offieial 
metro map (32). 

Navigation in sueh large networks is already nontrivial (13), and it has been observed that 
there is an eye-movement behavioral transition when the system beeomes too large (i.e., when 
there are too many eonneetions) (12). The value S'max for trips with two eonneetions thus 
provides a natural limit, above whieh human eognitive eapabilities are ehallenged and for whieh 
it beeomes extremely diffieult to find a simplest path. We thus make the reasonable ehoiee to 
take Smax as the eognitive limit for publie transportation: a human needs an information entropy 
of S(s,t) < S'max to be able to navigate in a network sueeessfully without assistanee from 
information teehnology tools. 

To gain a physieal understanding for the eognitive limit S'max, we estimate S{s,t\C = 2) 
for a regular lattiee (like the one in Fig.|^ with N lines that are eaeh eonneeted with N/2 other 
lines (i.e., kr = iV/2 for all r). This ehoiee of a lattiee is justified by the results in Ref. (3) that 
most large metropolitan transportation networks eonsist of a eore set of nodes with branehes that 
radiate from it. The eore is rather dense and has a peaked degree distribution, so it is reasonable 
to use a regular lattiee for eomparison. In the dual spaee of the regular lattiee, the degree kg of 
route s is equal to N/2, and we thus obtain 


S(s,f|C = 2) = \og 2 [ks(kr - 1)] 



( 3 ) 
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where {k) denotes the mean degree. The last equality in Eq. Q comes from the relation for the 
total degree of a regular lattice: = {k)N = 2{k)‘^. The key quantity for understanding 

5'max is therefore the total number of undirected connections iTtot = i’^ the dual 

space (also see Table 1). As we indicated in Eq. 0. this is identical to the square of the 
mean degree in a lattice. Eor Paris, for example, we obtain {k) 9.75, which leads to 

9.75^ ~ 95 connections for the corresponding lattice. The actual Paris metropolitan network 
has a total of 78 connections, and the difference comes from the fact that the real network is 
not a perfectly regular lattice. At this stage, it is important to make two remarks. Eirst, the 
apparently paradoxical fact that the total information grows with size for regular lattices while 
intuitively the complexity for finding a path stays constant is specific to the case in which there 
is an “algorithm” to find the route. Indeed, in perfectly regular rectangular (i.e., Cartesian) grids, 
one needs to make at most two turns to find a desired route. Second, drawing a parallel with 
an individual navigating a bifurcating tree from the root to a terminal node who is traversing 
8 bifurcation points (and thus 255 internal nodes), the value of 8 bits as a limit appears to be 
consistent with Miller’s “magic number” {5,30). “Miller’s law” limits also the number of binary 
decisions that can be memorized in sequence to a value that has been observed to be in the range 
7 ±2. 


E F G H 




Eigure 3: (Left) Primal and (right) dual networks for a square lattice. In this example, the 
lattice has N = 8 routes. Each route has k = N/2 = A connections, so the total number of 
connections is if tot = = 16. In the dual network, the four East-West routes (A,B,C,D) and 

the four North-South routes (E,E,G,H) yield a graph with a diameter of 2. 

We test Eq. Q in the right panel of Eig. [^for the 15 largest metropolitan networks. Our 
calculation shows that the total degree in the dual space, which is related to the total number 
of connections in a network, is the main ingredient for understanding the information entropy 
of these systems. We also test this relation for the temporal evolution of the Paris metro net¬ 
work and we show that the number of connections scales as (iV/2)^ for the historical 

growth from 77 = 1 to 77 = 14 routes (see Pig. 8). Equation Q allows us to translate the 
information limit of 8 bits to a limit on the number T of intra-route connections {S = log 2 (T)). 
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The value of T also eorresponds to the number of distraetors to be exeluded for the most eom- 
plex trips (with C{s, t) = 2). This proeess of exelusion thus demands progressive information 
integration, whieh eauses a eognitive overload. 

Evidenee for the existenee of sueh a eognitive threshold is the ehange in seareh strategy 
observed in eye-traeking experiments (12,14). The value T 250 represents the worst-ease 
seenario in the world’s largest metropolitan network. It thus overestimates the values at whieh 
the transition oeeurs. Indeed, the Paris MRT network, for whieh the strategy ehange was ob¬ 
served in (72), has K^ot ~ 162. . It is interesting that T has a similar order of magnitude as 
the Dunbar number, an extensively studied eognitive limit for the size of a friendship’s eirele, 
and whieh seems to lie in a range between 100 and 200. (See, for example, (9) for a reeent 
diseussion of this topie.). 

Effect of Multimodal Couplings on the Information 

In our diseussion above, we estimated the eognitive threshold for the most eomplex paths in the 
15 largest metropolitan networks. We now eonsider the effeet of ineluding other transportation 
modes (e.g., buses, trams, ete.). The effeets of inter-modal eoupling are signifieant (2, 31), and 
the natural framework is a multilayer network {33, 34), whieh assoeiates eaeh transportation 
mode with a different “layer” in a network and where interehanges (i.e., eonneetion points) 
between different modes are represented by inter-layer edges. As we diseuss in the Materials 
and Methods seetion, a major diffieulty eomes from obtaining data for different modes for a 
given eity, as this type of data is not eurrently available in general. However, we were able 
to eombine multiple data sourees for large eities on three different eontinents: New York City, 
Paris, and Tokyo (see Table 2). 

The distribution of S{s,t) is a superposition of peaks assoeiated to different values of C 
(see Fig. 6). By eomparing the distribution of S{s, t) for the bus monolayer network and the full 
multilayer transportation network, we ean distinguish two eompeting effeets of multimodality: 
(i) it tends to reduee the number C of eonneetions and thereby reduees S', and (ii) it inereases 
S, beeause new routes inerease the node degrees in the dual spaee. (See Fig. 7) However, these 
two eontributions do not eompensate eaeh other. In Fig. Q we show the eumulative distribution 
F{S{s, t)) of information entropy values for the New York, Paris, and Tokyo multimodal (i.e., 
multilayer) transportation systems. 

We find that less than approximately 17% (the maximum value obtained for Tokyo) of the 
trips are below the threshold S'max- These results imply that more than 80% of the trajeetories 
in the eomplete publie transportation networks of these major eities require more information 
than the most eomplieated trajeetory in the largest metro networks. As we show in the Supple¬ 
mentary Materials, the other 20% eorrespond to pairs of nodes for whieh the trip has essentially 
one eonneetion (for NYC) or at most two eonneetions (for Paris and Tokyo), as the simplest 
paths that earry a small amount of information are those that avoid using too many major hubs. 
(See Table 4 and Fig. 9). The number of eonneetions aeting as distraetor for the ease of the 
Paris MRT is already so large that it has a erueial impaet in the route seareh (it takes roughly 30 
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Figure 4: Information entropy of multilayer networks. The solid eurves represent the eu- 
mulative distributions of S{s,t) for multilayer networks that include a metro layer for New 
York City, Paris, and Tokyo. (We associate one layer to bus routes and another to metro lines.) 
Most of the trips require more information than the cognitive limit S'max ~ 8.1. The fraction 
of trips under this threshold are 15.6% for NYC, 10.7% for Paris, and 16.6% for Tokyo. The 
dashed curves are associated to all possible paths in a metro layer; in this case, the amount of 
information is always under the threshold, except for Paris, which includes trips with C = 3. 
(See Table 3). Note that the threshold value lies in a relatively stable part of all three cumulative 
distributions, which suggests that our results are robust with respect to small variations of the 
threshold. 

seconds on average for such as search (13)), and the complexity of the bus layer (and therefore 
of the coupled metro and bus system) will therefore exceed the human capacity. Consequently, 
traditional maps that represent all existing bus routes have a very limited utility. This result thus 
calls for the need of thinking about a user-friendly way to present and to use bus routes. For 
example, unwiring some bus-bus connections lowers the information, and leads to the idea that 
a design centered around the metro layer could be efficient. Further work is however needed to 
reach an efficient, ’optimal’ design from a user perspective. 

Finally, we note that one can also think of finding an optimal path by separately considering 
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different transportation modes. The price to pay for such a strategy would be to settle for 
suboptimal trajectories (which are neither simplest nor shortest), as an optimal trajectory is, by 
definition, computed on a full multilayer transportation network. 


Discussion 

Human cognitive capacity is limited, and cities and their transportation network have grown to 
the point that they have reached a level of complexity that is beyond human processing capabil¬ 
ity to navigate in them. In particular, the search for a simplest path becomes inefficient when 
multimodality is important and when a transportation system has too many interconnections. 
This occurs because of interconnections, which play two crucial roles in the search for a path: 
they are both targets and distractors. The identification of possible interchange points is a key 
and extremely time-consuming passage in the route-finding task (72). As with the case of hid¬ 
den objects (74), one can represent the difficulty of the search using the number of distractors, 
which for a map are all possible interchanges. We have found in the largest cities that the ad¬ 
dition of bus routes with maps that are already too complicated to be easily used by travelers 
implies that the cognitive limit to urban navigation is exceeded for multimodal transportation 
systems. 

We have estimated the cognitive threshold value to be of T ^ 250 connections (i.e., ap¬ 
proximately 8 bits), which represents the worst-case scenario for the most complex trips in the 
largest networks. This exceeds the behavioral transition, beyond which humans have difficulties 
to navigate on their own. 

Remarkably, we note that T has a similar interpretation and order of magnitude as the Dun¬ 
bar number, and our result can thus be construed as evidence to suggest the existence of a 
“transportation Dunbar number” for processing complex maps. Although the value of T rep¬ 
resents an upper bound, it allows us to demonstrate the existence of the huge gap between the 
amount of information that a human can process easily (Tf^tot < 250) and the amount that is 
contained in practice in multimodal transportation networks in large cities (Tf^tot > 1800, see 
Table 2). Indeed, the growth of transportation systems has yielded networks that are so en¬ 
tangled with each other and so complicated that a visual representation on a map becomes too 
complex and ultimately useless. In particular, our results imply that maximizing the number 
of intersections between lines, which indeed minimizes transfers, is contrary to the goal of 
having an easily usable transportation system. The information-technology tools provided by 
companies and transportation agencies that help people to navigate in transportation systems 
will soon become necessary in all large cities. Our analysis highlights the fact that humans 
need to integrate navigation complexity and seek new solutions that will help for navigation 
in megacities. Redesigning maps and representations of transportation networks (36) as well 
as improving information-technology tools (37) that help to decrease the amount information 
below the human processing threshold thus appear to be crucial for an efficient use of services 
provided by transportation agencies. 
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Materials and Methods 

Data for the World’s 15 Largest Metropolitan Transportation Networks 

The data that we use was extracted from Wikipedia (38). It describes the lines in the metropoli¬ 
tan networks as they were in 2009, and the data were used in a previous publication (3). The 
data for each metropolitan system yields a spatial network (39); each station has geographical 
coordinates, and the edges connect consecutive stations on a metro line. The diagnostic that we 
use for determining a shortest simplest path is the sum of the traveling times between stops. 

Paris 

The data that we use for constructing the Paris multilayer network (33, 34) come from the 
Regie Autonome des Transports Parisiens (40) and the Societe Nationale des Chemins de fer 
Fran 9 ais (41). Both data sets are provided in GTFS format. We extract the bus layer from 
the RATP data. To reproduce all of the information available in the Paris Metro Map (32), we 
merge the metro and tramway data from the RATP with the light rail and tramway data from 
the SNCF This aggregation gives what we call the “metro-rail-tramway” (MRT) layer, which 
we use as a single metro layer. We distinguish services that go in opposite directions or towards 
different branches, and we are able to identify them as a particular route because they share 
the same short name. We use the services that were available on Monday 26/5/2014, and we 
exclude any bus route that is completely subsumed by another route. 

The RATP data provide extremely detailed transfer times between stops. We use this in¬ 
formation to reconstruct connections between routes. With the exception of connections in the 
central station Chatelet-Les Halles, we ignore transfers that take longer than 8 minutes and con¬ 
sider the two corresponding nodes as different entities. By studying the cumulative distribution 
for the walking distances of these connections data, we find that the 99th percentile corresponds 
roughly to = 250m. More precisely, F(250m) ~ 0.986. Motivated by this calculation, we 
allow a maximum walking distance of t = 250m for the coarse-graining procedure that we 
need to construct the multilayer networks in New York City (NYC) and Tokyo. 

For Paris, the transfer data allow a more accurate method than for NYC and Tokyo. We con¬ 
struct a transfer network in which (i) a transfer is defined between two stops and (ii) two stops 
are located in the same geographical position. We associate a single node to each connected 
component of the transfer network, and each isolated stop constitutes a single node. We weight 
the edges using travel times. Relatively large connection areas emerge from our choices. They 
correctly reflect possible choices that are suggested to travelers on the Paris metro map (32) (via 
the representation of intra-station walking paths as white edges). 

Finally, we also couple nodes if (i) they are closer than = 250 meters and if they come 
from RATP data and are labeled with the same name or (ii) they come from SNCF data (where 
transfer times between lines were not provided). Intra-route connections are possible if routes 
share the same node. As for NYC, the trip length is equal to the sum of the mean travel times 
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between consecutive stops. 


New York City 

We construct the NYC multilayer network with data from the Metropolitan Transportation Au¬ 
thority (MTA) Data Feeds (42), which provide a snapshot of their service for different days 
using the General Transit Feed Specification (GTFS) {43). We use the services that were avail¬ 
able on Monday 1/12/2014. For the metro layer, we exclude six routes that represent shuttles or 
other special services. 

For the bus layer, we integrate the data from the five NYC boroughs with multi-borough 
data of the subsidiary MTA Bus Company. We perform a coarse-graining procedure in which 
(i) we couple bus stops to metro locations if they lie within a walking distance of = 250 
meters and then (ii) combine bus stops into a single entity if their distance is less than dyj. We 
therefore allow intra-route connections if two routes share the same stop (where coupled stops 
count as a single stop). As with Paris, the trip length is given by the sum of the mean travel 
times between consecutive stops. 

Tokyo 

The Tokyo metro data set, which we have extracted from Wikipedia, allows us to study another 
of the world’s largest metropolitan networks (3S). The bus data for Tokyo (for 2010) is provided 
freely by the Japanese Ministry of Land, Infrastructure, Transport, and Tourism (44). We use 
only Toei Bus lines (45), which serve central Tokyo. 

This data set associates stops to bus routes, but it gives no information about the topology of 
the line. We reconstruct an approximate topology by generating a minimal spanning tree (46) 
among all stops of a route. We define the intra-route connections using the same coarse-graining 
procedure as for NYC. We again combine nodes that are within a walking distance of 4^, = 
250 meters from each other. We have no information on the travel times along the edges, so 
we estimate them from the trip length and using typical transportation speeds for the bus and 
metro in the Paris data. Because one can approximate each layers’ speed by a log-normal 
distribution, we use the log-average v = exp (log(v)} as the typical speed associated to each 
mode of transport. The values that we find are vb ~ 14.0 km/h) for the bus and (vm ~ 23.4 
km/h) for the metro. 

Definitions of Paths and Information Entropy 

From the perspective of information processing, one can quantify the difficulty of navigating in 
an urban transportation network using a measure of “search information’’ S (16). This measure 
represents the amount of information that is needed for encoding a path from a route s to a 
route t that follows a simplest path. We define a simplest path as a shortest path in the dual 
space of a transportation network. In the present context, the dual space (which is also called 
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“P-space” (39)) is the network in whieh the nodes represent routes and the edges represent 
possible interseetions (or eonneetions between different lines) among those routes. There ean 
be many degenerate simplest paths between two routes, and one ean define the information 
entropy eomputed on these paths to be {16) 


S^{s,t) 


-logs 





(4) 


where the set {p(s,f)} ineludes all D{s,t) degenerate simplest paths between routes s and t. 
The quantity kg indieates the total number of eonneetions that emanate from route s. On a path, 
one has a ehoiee between kn — 1 routes (i.e., exeluding the route whenee the path emanates). 

When one is seeking an optimal trajeetory, the degeneraey is not neeessarily a signifieant 
faetor, and we foeus on a trip from a node i to another node j in real spaee. Among all of the 
degenerate simplest paths, we piek a shortest one p(s, t, i, j) and eonsider its entropy 


S{i,s; j, t) = log 2 (ks) + ^ log 2 (/c„ - 1). (5) 

n£{p(s,t,i,j)} 


Averaging over all nodes i G s and j G t yields the main quantity that we used in the main text: 


S{s,t) = {S{s,t,i,j)). 


( 6 ) 


In theory, it is also possible to weight this mean using real flows. Unfortunately, for this par- 
tieular ease, traditional data sourees are insuffleient, as they tend to deseribe the mobility of 
eommuters, who do not neeessarily rely on maps for their daily journeys. If we assume that all 
of the degenerate paths eontribute equally to the entropy, then we obtain the following approxi¬ 
mate relation between the entropies: 

S{s,t) ^ S^{s,t)+ \og 2 [D{s,t)]. (7) 

Deviations from the (approximate) equality Q indieate differenees between the various degen¬ 
erate paths (see Fig. 10). 

From a map user’s perspeetive, the existenee of several alternative simplest paths is not 
neeessarily a signifieant faetor, as one only needs a single simplest path for sueeessful trans¬ 
portation from origin to destination. The natural ehoiee is the fastest simplest path path, whieh 
is granted as unique for eontinuous travel-times. Consequently, we use the entropy in Eq. 
rather than the one proposed in Ref. {16) 
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City 

Abbreviation 

Nodes 

Edges 

N 


P-diameter 

New York City 

NY 

433 

497 

22 

161 

2 

Paris 

Pa 

299 

355 

16 

78 

3 

Tokyo 

To 

217 

262 

13 

56 

2 

London 

Lo 

266 

308 

11 

48 

2 

Madrid 

Ma 

209 

240 

12 

38 

2 

Bareelona 

Ba 

139 

165 

11 

37 

2 

Moseow 

Mo 

134 

156 

11 

35 

2 

Seoul 

Se 

420 

466 

12 

35 

3 

Shanghai 

Sh 

239 

264 

11 

35 

3 

Mexieo City 

MC 

147 

164 

11 

31 

2 

Berlin 

Ber 

170 

282 

10 

29 

2 

Chieago 

Ch 

167 

222 

8 

25 

2 

Osaka 

Os 

108 

123 

9 

24 

2 

Beijing 

Bei 

163 

176 

13 

21 

4 

Hong Kong 

HK 

84 

87 

10 

12 

4 


Table 1 : Network characteristics of the largest connected component for the 15 largest 
metropolitan systems in the world. The number of routes N and eonneetions K^ot, respee- 
tively, yield nodes and edges in the dual spaee. We list eities from most eonneetions to fewest 
eonneetions between different lines. The number of eonneetions Ktot is the key quantity from 
the perspeetive of information proeessing. (See the right panel of Fig. 2.) P-diameter indi- 
eates the network diameter in dual spaee. It is equal to 2 for 10 of the 15 networks, and one 
additionally obtains a value of 2 in Paris if one euts “3bis” (a four-stop line). 
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Network 

Nodes 

Edges 

N 

K,ot 

NYC Metro 

412 

512 

20 

162 

NYC Bus 

5306 

8435 

309 

6092 

NYC multilayer 

5332 

8804 

330 

8461 

Paris MRT 

629 

765 

28 

162 

Paris Bus 

3842 

5567 

111 

2708 

Paris multilayer 

4037 

6142 

305 

4292 

Tokyo Metro 

217 

262 

13 

56 

Tokyo Bus 

1359 

1663 

153 

1275 

Tokyo multilayer 

1422 

1908 

166 

1831 


Table 2: Network characteristics of the Bus-Metro multilayer networks. Similarly to Ta¬ 
ble 1, we show the number of routes N and connections Ktot, and the nodes and edges of the 
dual space. We note that there is a difference of an order of magnitude between the dimension of 
the Metro and the Bus layers, which represents a huge jump in complexity challenging people’s 
ability to navigate the multilayer transport networks. 
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C= 1 

C = 2 

(^ = 3 

NYC 

85.2% 

14.8% 

0% 

Paris MRT 

43.0% 

48.5% 

8.5% 

Tokyo 

72.8% 

28.2% 

0% 


Table 3: Structure of Simplest Paths in Three Metro Systems. We eompare the number of 
eonneetions in the simplest paths for the metro systems of the three megaeities (New York City, 
Paris, and Tokyo) that we eonsider in detail. Only Paris has paths with more than 2 eonneetions. 
A negligible fraetion (not displayed) of paths have 4 or 5 eonneetions. 



C=1 

C=2 

NYC 

99.4% 

0.5% 

Paris 

86.4% 

13.6% 

Tokyo 

79.9% 

20.1% 


Table 4: Length proportions of the paths with S' < 8.1 hits. For the three megaeities that we 
eonsider in this paper, about 20% of the trips have an information entropy that is lower than the 
threshold of 8.1 bits. Sueh trips predominantly have only a single eonneetion. When there are 
more, the starting route has a limited number of eonneetions (see Fig.|^. 
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Figure 5: Examples of paths with growing S{s, t). For the New York City multilayer network, 
we show examples with increasing complexity: S{s,t) ranges from 4 bits to 24 bits. We color 
the starting bus line s in blue and the destination line t in red. 
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Figure 6: Entropy distribution for MRT layer, bus layer, and complete multilayer network 
in Paris. For the multilayer network, we restrict the distribution to trips whose origin and 
destination are each in the bus layer. We see that the effect of multiplexity on the bus layer is to 
shift the peaks to the right, and we also obtain larger peaks for smaller values of C (see Fig. |^. 
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Figure 7: Effects of multiplexity. (Left) The values of S grow with C. The growth is larger for 
the multilayer network (in the sense that it is eharaeterized by the highest value of (k)), smaller 
for the bus layer (where the lines have fewer eonneetions), and is smallest for the MRT layer. 
(Right) Conversely, the mean path length is smaller for the bus monolayer network than for the 
multilayer network, in whieh the bus serviee internets with the (longer-range) lines in the MRT 
layer. 



N 


Figure 8: Growth of the Paris Metro Network. Letting the network grow with its historieal 
progression from line 1 to line 14, we see that the number of eonneetions in the dual spaee Tftot 
(blue dots) grows similarly to a lattiee (red line), whieh would have (iV/2)^ interseetions. 
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Figure 9: Dual-space degree of low-information starting points for paths with (7 = 2 in the 
Tokyo multilayer network. As one ean see in Table Q most of the trips below the eognitive 
limit have (7 = 1 conneetions. The trips below the eognitive threshold with (7 > 1 eonnections 
are eharaeterized by a low eonneetivity of the origin route. We show this feature for the Tokyo 
ease. We see that (7 = 2 for 20.1% of the trips that are below the threshold of 8.1 bits. For this 
fraetion of trips s —)■ f, the degrees k in the dual spaee of origin routes s (red squares) are small 
in eomparison to the degrees of all routes on the whole multilayer network (blue eireles). 
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Figure 10: Empirical validation of Eq. (7). Comparing the (left) bus monolayer network to 
the (right) multilayer network in Paris, we note that ineluding the metro-rail-tram (MRT) layer 
yields larger fluetuations. The mean square deviation is 0.34 bits for the bus layer and 0.82 bits 
of the multilayer network that contains both bus and MRT modes. This suggests that for the 
same route pair (s, t), different paths become optimal for different origins i and destinations j. 
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Figure 11: Information entropy of multilayer networks. This figure represents the probability 
density distributions of S{s, t). In Fig. 4 of the main text, we show the assoeiated eumulative 
distributions. Similar to Fig. 4, we assoeiate one layer to bus routes and another to metro lines. 
The solid eurves are assoeiated with multilayer networks that include a metro layer for New 
York City, Paris, and Tokyo. The dashed curves are associated to all possible paths in a metro 
layer. We observe that every distribution is characterized by a peak structure, and every peak is 
associated to a number C of connections. 
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